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Abstract 

Entropic Dynamics is a framework in which dynamical laws are derived 
as an application of entropic methods of inference. No underlying action 
principle is postulated. Instead, the dynamics is driven by entropy subject 
to the constraints appropriate to the problem at hand. In this paper we 
review three examples of entropic dynamics. First we tackle the simpler 
case of a standard diffusion process which allows us to address the central 
issue of the nature of time. Then we show that imposing the additional 
constraint that the dynamics be non-dissipative leads to Hamiltonian dy¬ 
namics. Finally, considerations from information geometry naturally lead 
to the type of Hamiltonian that describes quantum theory. 


1 Introduction 

The laws of physics, and in particular the laws of dynamics, have traditionally 
been seen as laws of nature. It is usually believed that such laws are discovered 
and that they are useful because they reflect reality. The reflection, imperfect 
though it may be, represents a very direct relation between physics and nature. 
Here we explore an alternative view in which the relation is considerably more 
indirect: The laws of physics provide a framework for processing information 
about nature. From this perspective physical models are mere tools that are 
partly discovered and partly designed with our own very human purposes in 
mind. This approach is decidedly pragmatic: when tools happen to be successful 
we do not say that they are true; we say that they are useful. 

Is there any evidence in support of such an unorthodox view? The answer 
is yes. Indeed, if physics is an exercise in inference then we should expect it 
to include both ontic and epistemic concepts. The ontic concepts are meant 
to represent those entities in nature that are the subject of our interest. They 
include the quantities, such as particle positions and field strengths, that we 
want to predict, to explain, and to control. The epistemic concepts, on the other 
hand, are the tools — the probabilities, the entropies, and the (information) 
geometries — that are used to carry out our inferences. The prominence of these 
epistemic elements strongly suggests that physics is not a mere mirror of nature; 
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instead physics is an inference framework designed by humans for the purpose 
of facilitating their interactions with nature. The founders of quantum theory 

— Bohr, Heisenberg, Born, etc. — were quite aware of the epistemological and 
pragmatic elements in quantum mechanics (see e.g., [T]) but they wrote at a 
time when the tools of quantitative epistemology — the Bayesian and entropic 
methods of inference — had not yet been sufficiently developed. 

Entropic Dynamics (ED) provides a framework for deriving dynamical laws 
as an application of entropic methods. (The principle of maximum entropy 
as a method for inference can be traced to E. T. Jaynes. For a pedagogical 
overview of Bayesian and entropic inference and further references see [5].) In 
ED the dynamics is driven by entropy subject to the constraints appropriate 
to the problem at hand. It is through these constraints that the “physics” is 
introduced. Such a framework is extremely restrictive. For example, in order 
to adopt an epistemic view of the quantum state ■i/' it is not sufficient to merely 
assert that the probability represents a state of knowledge; this is a good 
start but it is not nearly enough. It is also necessary that the changes or updates 
of the epistemic tp — which include both the unitary time evolution described the 
Schrodinger equation and the collapse of the wave function during measurement 

— be derived according to the established rules of inference. Therefore in a truly 
entropic dynamics we are not allowed to postulate action principles that operate 
at some deeper sub-quantum level. Instead, the goal is to derive such action 
principles from entropy principles with suitably chosen constraints. 

In this paper we collect and streamline results that have appeared in sev¬ 
eral publications (see [3]-[6] and references therein) to provide a self-contained 
overview of three types of ED: (I) standard diffusion, (2) Hamiltonian dynamics, 
and (3) quantum mechanics. First we tackle the case of a diffusion process which 
serves to address the central concern with the nature of time. In ED “entropic” 
time is a relational concept introduced as a book-keeping device designed to 
keep track of the accumulation of change. Time is constructed by (a) intro¬ 
ducing the notion of “instants”, (b) showing that these instants are “ordered”, 
and (c) defining a measure of the interval that separates successive instants. 
The welcome new feature is that an arrow of time is generated automatically; 
entropic time is intrinsically directional. 

The early formulation of ED [J involved assumptions about auxiliary vari¬ 
ables, the metric of configuration space, and the form of the quantum potential. 
All these assumptions have been subsequently removed. In [6] it was shown how 
the constraint that the dynamics be non-dissipative leads to a generic form of 
Hamiltonian dynamics, with its corresponding symplectic structure and action 
principle. Thus, in the context of ED action principles are derived] they are 
useful tools but they are not fundamental. 

Different Hamiltonians lead to different dynamical laws. We show how con¬ 
siderations of information geometry provide the natural path to Hamiltonians 
that include the correct form of “quantum potential” and lead to the Schrodinger 
equation, and we also identify the constraints that describe motion in an exter¬ 
nal electromagnetic field. 

Here we focus on the derivation of the Schrodinger equation but the ED 
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approach has been applied to several other topics in quantum mechanics that 
will not be reviewed here. These include the quantum measurement problem 
Emm-. momentum, angular momentum, their uncertainty relations, and spin 
mm-. relativistic scalar fields m-, the Bohmian limit m-, and the extension 
to curved spaces m- 

There is vast literature on the attempts to reconstruct quantum mechanics 
and it is inevitable that the ED approach might resemble them in one aspect or 
another — after all, in order to claim success all these approaches must sooner or 
later converge to the same Schrddinger equation. However, there are important 
differences. For example, the central concern with the notion of time makes ED 
significantly different from other approaches that are also based on information 
theory (see e.g., [ll]-[20]). ED also differs from those approaches that attempt to 
explain the emergence of quantum behavior as the effective statistical mechanics 
of some underlying sub-quantum dynamics which might possibly include some 
additional stochastic element (see e.g., [11]-[1H])- Indeed, ED makes no reference 
to any sub-quantum dynamics whether classical, deterministic, or stochastic. 


2 Entropic Dynamics 

As with other applications of entropic methods, to derive dynamical laws we 
must first specify the microstates that are the subject of our inference — the 
subject matter — and then we must specify the prior probabilities and the con¬ 
straints that represent the information that is relevant to our problem. (See e.g., 
[5].) We consider N particles living in a flat Euclidean space X with metric Sab- 
We assume that the particles have definite positions a:® and it is their unknown 
values that we wish to infer0 (The index n = I... N denotes the particle and 
a = 1,2,3 the spatial coordinates.) For N particles the configuration space is 
Xat = X X ... X X. 

The assumption that the particles have definite positions is in flat contra¬ 
diction with the standard Copenhagen notion that quantum particles acquire 
definite positions only as a result of a measurement. For example, in the ED 
description of the double slit experiment we do not know which slit the quantum 
particle goes through but it most definitely goes through either one or the other. 

We do not explain why motion happens but, given the information that it 
does, our task is to produce an estimate of what short steps we can reasonably 
expect. The next assumption is dynamical: we assume that the particles follow 
trajectories that are continuous. This means that displacements over finite 
distances can be analyzed as the accumulation of many infinitesimally short 
steps and our first task is to find the transition probability density P{x'\x) for 
a single short step from a given initial x G X^v to an unknown neighboring 
x' G X^f. Later we will determine how such short steps accumulate to yield a 

^In this work ED is developed as a model for the quantum mechanics of particles. The 
same framework can be deployed to construct models for the quantum mechanics of fields, in 
which case it is the fields that are objectively “real” and have well-defined albeit unknown 
values. m 
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finite displacement. 

To find P{x'\x) we maximize the (relative) entropy, 

sip,<3i=-yd.'p(x'wiogS^. (1) 

To simplify the notation in all configuration space integrals we write x = da;, 
fpN x' — fij.'^ and so on. Q{x'\x) is the prior probability. It expresses our 
beliefs — or more precisely, the beliefs of an ideally rational agent — before any 
information about the motion is taken into account. The physically relevant 
information about the step is expressed in the form of constraints imposed on 
P(a:'|a;) — this is the stage at which the physics is introduced. 

The prior — We adopt a prior Q{x'\x) that represents a state of extreme 
ignorance: knowledge of the initial position x tells us nothing about x'. Such 
ignorance is expressed by assuming that Q{x'\x)dx' is proportional to the volume 
element in Xjv. Since the space X^r is flat and a mere proportionality constant 
has no effect on the entropy maximization we can set Q{x'\x) = 10 

The constraints — The first piece of information is that motion is continuous 
— motion consists of a succession of infinitesimally short steps. Each individual 
particle n will take a short step from xf to x'f^ = xf + Aa;“ and we require that 
for each particle the expected squared displacement, 

{AxfAx'f)Sab = Kn , {u = 1 . . . N) ( 2 ) 

takes some small value Infinitesimally short steps are obtained by taking 
the limit «;„—>■ 0. We will assume each to be independent of x to reflect 
the translational symmetry of X. In order to describe non-identical particles 
we assume that the value of depends on the particle index n. 

The N constraints (HI) treat the particles as statistically independent and 
their accumulation eventually leads to a completely isotropic diffusion. But 
we know that particles can become correlated and even become entangled. We 
also know that motion is not normally isotropic; once particles are set in motion 
they tend to persist in it. This information is introduced through one additional 
constraint involving a “drift” potential 4>{x) that is a function in configuration 
space, X G Xtv- We impose that the expected displacements (Ax“) along the 
direction of the gradient of ij) satisfy 


N „ , 

{Ax^)dAcf = Y.{Axl)^ = n' , (3) 

n—1 ^ 

^Uniform non-normalizable priors are mathematically problematic. For us this is only 
a mild annoyance that is easily evaded by adopting a physically reasonable normalizable 
prior. The idea is that for a microscopic particle such as an electron a “uniform” prior is 
any distribution that is essentially flat over macroscopic scales. For example we can use a 
Gaussian centered at x with a macroscopically large standard deviation. 
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where capitalized indices such as A = (n, a) include both the particle index and 
its spatial coordinate; Oa = djdx^ = and n' is another small but for 

now unspecified position-independent constant. 

The introduction of the drift potential (j) will not be justified at this point. 
The idea is that we can make progress by identifying the constraints even when 
their physical origin remains unexplained. This situation is not unlike classical 
mechanics where identifying the forces is useful even in situations where their 
microscopic origin is not understood. We do however make two brief comments. 
First, in section 9 we shall see that ED in an external electromagnetic field 
is described by constraints that are formally similar to ©■ There we shall 
show that the effects of the drift potential </> and the electromagnetic vector 
potential Aa are intimately related — a manifestation of gauge symmetry — 
suggesting that whatever </> might be, it is as “real” as Aa- The second comment 
is that elsewhere, in the context of a particle with spin, we will see that the drift 
potential can be given a natural geometric interpretation as an angular variable. 
This imposes the additional condition that the integral of (f) over any closed loop 
is quantized, §d(f) = 2 'kv where v is an integer. 

Maximizing iS[P, Q] in ([T]) subject to the N -\-2 constraints ([2]), © plus 
normalization yields a Gaussian distribution, 


P{x'\x) = iexp V 


Ax'^AxlSab + a'Ax'^-^ 


( 4 ) 


where C = C(x, an, a') is a normalization constant and the Lagrange multipliers 
an and a' are determined from 


dlog(/dan = —Hnl‘2‘ and d\ogC,/da' = k! . (5) 


The distribution P{x'\x) is conveniently rewritten as 


P{x'\x) = — exp 


2 ^ 


an5,b{Axl- {Axl)){Axi- {Axi)) 


( 6 ) 


where Z is & new normalization constant. A generic displacement Aa;“ = x'n — 
Xn can be expressed as an expected drift plus a fluctuation. 


A< = (A<) + A<, where (Ax“) = , (7) 

Cxn 

(AO = 0 and {Aw'^.Awt) =—6^’’ . (8) 

From these equations we can get a first glimpse into the meaning of the mul¬ 
tipliers an and a'. For very short steps, as —>■ oo, the fluctuations become 
dominant: the drift is (Ax®) ^ 0 (q!“^) while Aw“ This implies 

that, as in Brownian motion, the trajectory is continuous but not differentiable. 
In the ED approach a particle has a definite position but its velocity, the tangent 
to the trajectory, is completely undefined. We can also see that the effect of a' is 
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to enhance or suppress the magnitude of the drift relative to the fluctuations — 
a subject that is discussed in detail in [12]. However, for our current purposes 
we can absorb o! into the so far unspecified drift potential, a'4> —t <^, which 
amounts to setting a' = 1. 

3 Entropic time 

The goal is to derive dynamical laws as an application of inference methods 
but the latter make no reference to time. Therefore, additional assumptions are 
needed to specify what constitutes “time” in our framework. 

The foundation to any notion of time is dynamics. We must first identify 
a notion of “instant” that properly takes into account the inferential nature of 
entropic dynamics. Time is then is constructed as a device to keep track of 
change or, more explicitly, of the accumulation of repeated small changes. 

3.1 Time as an ordered sequence of instants 

Entropic dynamics is generated by the short-step transition probability P{x'\x)^ 
eq.®. In the ith step, which takes the system from x = Xi-i to x' = Xi, both 
Xi-i and Xi are unknown. Integrating the joint probability, P{xi,Xi-i), over 
x^-i gives 



(9) 


These equations involve no assumptions; they are true by virtue of the laws of 
probability. However, if P{xi-i) happens to be the probability of different values 
of Xi-i at an “instant” labelled t, then we can interpret P{xi) as the probability 
of values of Xi at the next “instant” which we will label t'. Accordingly, we write 
P{xi-i) = p{x,t) and P{xi) = p{x',t') so that 



( 10 ) 


Nothing in the laws of probability leading to eq.® forces the interpretation pop 
on us — this is an independent assumption about what constitutes time in our 
model. We use eq. oni) to define what we mean by an instant: if the distribution 
p{x,t) refers to one instant t, then the distribution p{x',t') generated by P{x'\x) 
defines what we mean by the “next” instant t'. The iteration of this process 
defines the dynamics: entropic time is constructed instant by instant: p{t') is 
constructed from p(t), p{t") is constructed from p{t'), and so on. 

The inferential nature of the construction can be phrased more explicitly. 
Once we have decided on the relevant information necessary for predicting fu¬ 
ture behavior — the distributions p{x,t) and P{x'\x) — we can say that this 
information defines what we mean by an “instant”. Furthermore, eq. m shows 
that 
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Time is designed in such a way that given the present the future is inde¬ 
pendent of the past. 

An equation such as 113 is commonly employed to define Markovian behavior 
in which case it is sometimes known as the Chapman-Kolmogorov equation. 
Markovian processes are such that once an external notion of time is given, 
defined perhaps by an external clock, the specification of the state of the system 
at time t is sufficient to fully determine its state after time t — no additional 
information about times prior to t is needed. It should be emphasized that 
we do not make a Markovian assumption. We are concerned with a different 
problem: we do not use m to define a Markovian process in an already existing 
background time; we use it to construct time itself. 

3.2 The arrow of entropic time 

The notion of time constructed according to eq. m incorporates an intrinsic 
directionality: there is an absolute sense in which p{x,t) is prior and p{x',t') is 
posterior. To construct the time-reversed evolution we just write 

p{x,t) = Jdx' P{x\x')p{x',t'), (11) 

where according to the rules of probability theory P{x\x') is related to P{x'\x) 
in eq.® by Bayes’ theorem, 

P{x\x') = -^j^^P{x'\x) . (12) 

p[x',t') 

Note that this is not a mere exchange of primed and unprimed quantities: 
P{x'\x) is the Gaussian distribution, eq.®, obtained using the maximum en¬ 
tropy method; in contrast, the time-reversed P{x\x') obtained using Bayes’ 
theorem, eq. (HI, will not in general be Gaussian. The asymmetry between the 
inferential past and the inferential future is traced to the asymmetry between 
priors and posteriors. 

The subject of the arrow of time has a vast literature (see e.g. PU][5D]'). 
The puzzle has been how to explain the asymmetric arrow from underlying 
symmetric laws of nature. The solution offered by ED is that there are no 
symmetric underlying laws and the asymmetry is the inevitable consequence 
of entropic inference. From the point of view of ED the challenge is not to 
explain the arrow of time, but rather the reverse, how to explain the emergence 
of symmetric laws within an intrinsically asymmetric entropic framework. As 
we shall see below the derived laws of physics — e.g., the Schrodinger equation 
— are time-reversible but entropic time itself only flows forward. 

3.3 Duration: a convenient time scale 

To complete the model of entropic time we need to specify the interval At 
between successive instants. The basic criterion is that duration is defined so 
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that motion looks simple. From eqs. © and m for short steps (large a„) the 
motion is dominated by the fluctuations Aw„. Therefore specifying At amounts 
to specifying the multipliers a„ in terms of At. 

To start we appeal to symmetry: in order that the fluctuations (Aui“ AwJ^) 
reflect the symmetry of translations in space and time — a time that just like 
Newton’s flows “equably everywhere and everywhen” — we choose to be 
independent of x and t, and we choose At so that a„ oc 1/At. More explicitly, 
we write 




rrin 1 
77 At ’ 


(13) 


where the proportionality constants have been written in terms of some particle- 
specific constants m„, which will eventually be identified with the particle 
masses, and an overall constant 77 that fixes the units of the 77 i„s relative to 
the units of time and will later be regraduated into h. 

Before discussing the implications of the choice m it is useful to consider 
the geometry of the A-particle configuration space, X^r. 


4 The information metric of configuration space 


We have assumed that the geometry of the single particle spaces X is described 
by the Euclidean metric 6at- We can expect that the A-particle configuration 
space, Xjv = X x ... x X, will also have a flat geometry, but the relative 
contribution of different particles to the metric remains undetermined. Should 
very massive particles contribute the same as very light particles? The answer 
is provided by information geometry. 

To each point x € Xat there corresponds a probability distribution P{x'\x). 
Therefore Xat is a statistical manifold and up to an arbitrary global scale factor 
its geometry is uniquely determined by the information metric. 


lAB 


c 


^ ^ dx^ dx^ 


(14) 


where C is an arbitrary positive constant (see e.g., m)- A straightforward 
substitution of eqs.© and (IT^ into ea. (IT4l) in the limit of short steps (a„ ^ 00 ) 
yields 


7 AB 


Crrin 

r]At 


^nn' ^ab — 


CrUn 

r]At 


Sab ■ 


(15) 


We see that 7 ^^ diverges as At -a 0. The reason for this is not hard to find. 
As the Gaussian distributions P{x'\x) and P{x'\x + Ax) become more sharply 
peaked and it is easier to distinguish one from the other which translates into 
a greater information distance, ■jab 00 . In order to define a distance that 
remains meaningful for arbitrarily small At it is convenient to choose C oc At. 
In what follows the metric tensor will always appear in combinations such as 
7 . 45 At/C. It is therefore convenient to define the “mass” tensor. 


pAt 

rriAB = —^JAB = rrinOAB 


(16) 





Its inverse. 


(17) 


^ab 


C 

rjAt 


.AB 



mn 


is called the “diffusion” tensor. 

We can now summarize our results so far. The choice (fT3l) of the multipliers 
an simplifies the dynamics: P{x'\x) in (|6]) is a standard Wiener process. A 
generic displacement, eq. 0, is 


Ax^ = b^At + , 

where b"^{x) is the drift velocity, 

{Ax^) = b^At with b^ = = r]m^^dB(t> , 

rrin 

and the fluctuations Aw^ are such that 

(Aw^) = 0 and (Aw^Aw^) = -^6^^At = ■qm^^At . 

mn 


(18) 

(19) 

( 20 ) 


We are now ready to comment on the implications of the choice of time 
scale At and of multipliers eq. (US- The first remark is on the nature of 
clocks: In Newtonian mechanics the prototype of a clock is the free particle 
and time is defined so as to simplify the motion of free particles — they move 
equal distances in equal times. In ED the prototype of a clock is a free particle 
too — for sufficiently short times all particles are free — and time here is also 
defined to simplify the description of their motion: the particle undergoes equal 
fluctuations in equal times. 

The second remark is on the nature of mass. The particle-specific constants 
m„ will, in due course, be called ‘mass’ and ea. (l20l) provides their interpretation: 
mass is an inverse measure of fluctuations. Thus, up to overall constants the 
metric of configuration space is the mass tensor and its inverse is the diffusion 
tensor. In standard QM there are two mysteries: “Why quantum fluctuations?” 
and “What is mass?”. ED offers some progress in that instead of two mysteries 
there is only one. Fluctuations and mass are two sides of the same coin. 

Finally we note the formal similarity to Nelson’s stochastic mechanics [22]. 
The similarity is to be expected — all theories that converge on the Schrodinger 
equation must at some point become formally similar — but our epistemic 
interpretation differs radically from Nelson’s ontic interpretation and avoids the 
difficulties discussed in ED- 


5 Diffusive dynamics 

Equation m is the dynamical equation for the evolution oi p{x,t). It is written 
in integral form but it can be written in differential form as a Fokker-Planck 
(FP) equation (see e.g., [5]) 

dtp = -Oa {b^p) + ^qm^^dAdBP , (21) 
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or equivalently as a continuity equation, 

dtp = -Oa {pv^) , ( 22 ) 

where is the velocity of the probability flow or current velocity^ 

v^ = b^ + u^ and = -pm^^dB\ogp^/‘^ (23) 

is the osmotic velocity — it represents the tendency for probability to flow down 
the density gradient. Since both and are gradients, it follows that the 
current velocity is a gradient too, 

v^ = m^^dB^ where <f> = 77((/) — logp^^^) . (24) 

The FP equation 

dtp = -dA (pm^^dB^) , (25) 

can be conveniently rewritten in the alternative form 

(26) 

for some suitably chosen functional H[p, $]. It is easy to check that the appro¬ 
priate functional H is 

H[p, d>] = J dx ^pm^^dA^dB^ + F[p] , (27) 

where the integration constant F[p] is some unspecified functional of p. 

With these results we have demonstrated that a specific form of dynamics 
— a standard diffusion process — can be derived from principles of entropic 
inference. This diffusive dynamics can be written in different but equivalent 
ways — equations eq. dan, m. (123, and (l26ll are all equivalent. Next we turn 
our attention to other forms of dynamics such as quantum or classical mechanics 
which require a somewhat different choice of constraints. 

6 Hamiltonian dynamics 

The previous discussion has led us to a standard diffusion in which the density p 
evolves under the influence of some externally fixed drift potential (j). However, 
in quantum dynamics we require a second degree of freedom, the phase of the 
wave function. The extra degree of freedom is introduced into ED by replacing 
the constraint of a hxed drift potential (/) by an evolving constraint in which at 
each time step the potential (j) is readjusted in response to the evolving p. 

To find the appropriate readjustment of (/) we borrow an idea of Nelson’s 
|32] and impose that the potential (f) be updated in such a way that a certain 
functional, later called “energy”, remains constant. The next challenge is to 
identify the appropriate functional form of this energy but before this we make 
two remarks. 
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The standard procedure in mechanics is to derive the conservation of energy 
from the invariance of the action under time translations but here we do not 
have an action yet. The logic of our derivation runs in the opposite direction; 
we first identify the conservation of an energy as the piece of information that is 
relevant to our inferences and from it we derive Hamilton’s equations and their 
associated action principle. 

Imposing energy conservation appears to be natural because it agrees with 
our classical preconceptions of what mechanics is like. But ED is not at all 
like classical mechanics. Indeed, eg. (ITS)) is the kind of equation (a Langevin 
equation) that characterizes a Brownian motion in the limit of infinite friction. 
Therefore in the ED approach to quantum theory particles seem to be subject 
to infinite friction while suffering zero dissipation. Such a strange dynamics can 
hardly be called ‘mechanics’ much less ‘classical’. 


The ensemble Hamiltonian — The energy functional that codifies the cor¬ 
rect constraint is of the form (EZl). We therefore impose that, irrespective of 
the initial conditions, the potential (j) will be updated in such a way that the 
functional H[p, $] in (l?7l) is always conserved. 


dH 

dt 


f , 


/ ax 

—-h -^dtp 
(5$ 6p 


Using eg. (05)) we get 


did 

dt 



dt^ + ^ 

Sp 


dtp = 0 . 


(28) 


(29) 


We require that H = const, for arbitrary choices of the initial values of p and 
$. Erom eg. (1251) we see that this amounts to imposing dH/dt = 0 for arbitrary 
choices of dtp. Therefore the requirement that H be conserved for arbitrary 
initial conditions amounts to imposing that 


■ (30) 

dp 

Equations (IMl) and (IdHll have the form of a canonically conjugate pair of Hamil¬ 
ton’s equations. The conserved functional H[p, d>] in (l27l) will be called the 
ensemble Hamiltonian. We conclude that non-dissipative ED leads to Hamilto¬ 
nian dynamics. 


The action, Poisson brackets, etc. — The field p is a generalized co¬ 
ordinate and $ is its canonical momentum. Eo. ()30p leads to a generalized 
Hamilton-Jacobi equation, 

1 

dt^ = --m^^dA^dB^ - -j- ■ (31) 

2 dp 


11 






Now that we have Hamilton’s equations, (uni) and O, we can invert the usual 
procedure and construct an action principle from which they can be derived. 
Define the differential 


SA = J dt J dx 

and then integrate to get the action 




A[p, $] = / dt I / dx^p — H[p, $] 


(32) 


(33) 


By construction, imposing dH = 0 leads to (1^ and (1501) . 

The time evolution of any arbitrary functional f[p, <&] is given by a Poisson 
bracket, 


dt 




'sf 5H 

Sf 6H 

6p (54) 

(54) Sp 



{f,H} , 


(34) 


which shows that the ensemble Hamiltonian H is the generator of time evolution. 
Similarly, under a spatial displacement e“ the change in f[p, $] is 


4/[p,$] = {/,Ae“} 


(35) 


where 


Pn 


/ 


d^^xpj2 

n 


d<^ 

dx^ 



(36) 


is interpreted as the expectation of the total momentum, and are the coor¬ 
dinates of the center of mass. 




1 

M 




(37) 


A Schrodinger-like equation — We can always combine p and <1> to define 
the family of complex functions. 


4'fe = exp(ifc$/p). 


(38) 


where k is some arbitrary positive constant. Then the two coupled equations 
dMl) and (|30l) can be written as a single complex Schrodinger-like equation. 


■ V. 


1 




AB ^AdB\'^k\ 

- 7T7 - 

2 P 


'i'k 


Sp 


'i'k 


(39) 


The reason for the parameter k will become clear shortly, but even at this stage 
we can already anticipate that p/k will play the role of ft. 
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7 Information geometry and the Quantum Po¬ 
tential 

Different choices of the functional F[p] in (l?7l) lead to different dynamics. Earlier 
we invoked information geometry, eq. (HI, to define the metric tuab induced in 
configuration space by the transition probabilities P{x'\x). To motivate the 
particular choice of the functional F[p] that leads to quantum theory we appeal 
to information geometry once again. 

Consider the family of distributions p{x\6) that are generated from a distri¬ 
bution p{x) by pure translations by a vector 9^, p{x\6) = p{x — 9). The extent 
to which p{x\9) can be distinguished from the slightly displaced p{x\9 + d9) or, 
equivalently, the information distance between 9^ and 9^ d9 ^, is given by 



d^^ = gABd9^d9^ 

(40) 

where 

gAB{0) = J 

1 dp{x — 9) dp{x — 9) 

'^^p{x-9) 904 d9B 

(41) 

Changing variables x — 9 ^ 

X yields 


gAB{0) = 

f ^ 1 dp{x) dp{x) ^ r ^ 

/ = ^-**1 ^ 

(42) 


The functional F[p] — The simplest choice of functional F[p] is linear in p, 
F[p\ = J dxpV, where V{x) is some function that will be recognized as the 
familiar scalar potential. Since ED aims to derive the laws of physics from a 
framework for inference it is natural to expect that the Hamiltonian might also 
contain terms that are of a purely informational nature. We have identified two 
such tensors: one is the information metric of configuration space jab oc mAB, 
another is Iab[p\- The simplest nontrivial scalar that can be constructed from 
them is the trace Iab- This suggests 

F[p] = ^tti^^IabIp] + J dxpV , (43) 

where ^ > 0 is a constant that controls the relative strength of the two contribu¬ 
tions. The term Iab is sometimes called the “quantum” or the “osmotic” 
potential!! Erom ea. d^ we see that m^^lAB is a contribution to the energy 
such that those states that are more smoothly spread out tend to have lower 
energy. The case ^ < 0 leads to instabilities and is therefore excluded; the case 
^ = 0 leads to a qualitatively different theory and will be discussed elsewhere. 

m 

With this choice of F[p\ the generalized HJ equation becomes 

- = ]-m^^dA^dB^ + 14 - . (44) 

^The relation between the quantum potential and the Fisher information was pointed out 
in [33]. 


13 






8 The Schrodinger equation 


Substituting eq. (l43t into (l3^ gives a Schrodinger-like equation, 


= --^m^^dAdB'i’k + V'iik + 


2 fc2 


— 4^ I TO 


j^g dAdsl'^kl 

l^fcl 


, ( 45 ) 


the beauty of which is severely marred by the non-linear last term. 


Regraduation — We can now make good use of the freedom afforded by the 
arbitrary constant k. Since the physics is fully described by p and $ the different 
choices of A: in 'I'fc all describe the same theory. Among all these equivalent 
descriptions it is clearly to our advantage to pick the k that is most convenient 
— a process usually known as ‘regraduation’ □ 

A quick examination of ea. (l45|) shows that the optimal k is such that the 
non-linear term drops out. The optimal choice, which we denote k, is 

k = . (46) 

We then identify the optimal regraduated ri/k with Planck’s constant h, 

k 

and eg. (1451) becomes the linear Schrodinger equation, 

fc2 _t2 

ihdt'^ = — —m^^dAdB'^ + V'^ = , 

2 2 to „ 

n 

where the wave function is 'k = pe''^!^. The constant ^ vn ea. (l4^ 

turns out to be crucial: it defines the numerical value of what we call Planck’s 
constant and sets the scale that separates quantum from classical regimes. 

The conclusion is that for any positive value of the constant ^ it is always 
possible to regraduate 4'^ to a physically equivalent but more convenient de¬ 
scription where the Schrodinger equation is linear. From the ED perspective 
the linear superposition principle and the complex Hilbert spaces are important 
because they are convenient but not because they are fundamental. 


(47) 

(48) 


9 ED in an external electromagnetic field 

In ED the information that is physically relevant for prediction is codified into 
constraints that reflect that motion is (a) continuous, (b) correlated and direc¬ 
tional, and (c) non-dissipative. These constraints are expressed by eqs.([2]), ^ 

“^Other notable examples of regraduation include the Kelvin choice of absolute temperature, 
the Cox derivation of the sum and product rule for probabilities, and the derivation of the 
sum and product rules for quantum amplitudes. m na 
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and (1501) respectively. In this section we show that interactions can be intro¬ 
duced by imposing additional constraints. 

As an explicit illustration we show that the effect of an external electromag¬ 
netic field is modelled by a constraint on the component of displacements along 
a certain direction represented by the vector potential For each particle 

n we impose the constraint 

(Aa;“)Aa(x„) = k" , {n = l...N) (49) 


where k" is a particle-dependent constant that reflects the strength of the cou¬ 
pling to Aa. 

The resemblance between ((4^ and the drift potential constraint, eq.dS]), is 
very significant — as we shall see shortly it leads to gauge symmetries — but 
there also are significant differences. Note that (|31) is a single constraint acting 
in the A-particle configuration space — it involves the drift potential (/)(x) with 
X € Xw. In contrast, are N constraints acting in the I-particle space — 
the vector potential Aa(xn) is a function in 3D space, Xn S X. 

Except for minor changes the development of ED proceeds as before. The 
transition probability P{x'\x) that maximizes the entropy 5[P, Q], eq.([T|), sub¬ 
ject to dSD, (gSl) and normalization, is 


P{x'\x) = ^exp-^ 


ia„(5a&Aa;“ “ ct"Aa{xn) ) Ax'. 


(50) 


which includes an additional set of Lagrange multipliers a". Next use eqs. m 
and m to get 


P{x'\x) = - exp — 


2r]At 


mAsAx^Ax^ — ( dA4> — —Aa ) Aa; 


(51) 


where we have absorbed a' into (j), a'(j) —>■ </), and written 


Aa^x) = r]a”Aaixn) (52) 

as a vector in configuration space. As in eq. (1181) a generic displacement Aa:"^ can 
be expressed in terms of a expected drift plus a fluctuation, Aa:"^ = b"^At+Aw"^, 
but the drift velocity now includes a new term, 

{r]dA(l) - Aa) ■ (53) 

The fluctuations Aw"^, eq.dSO]), remain unchanged. 

A very significant feature of the transition probability P{x'\x) is its invari¬ 
ance under gauge transformations, 

d(l> ^ ^ d(t) ^ I dx{x„) 

9a;“ 9a;“ 9a;“ a' dx^ 

Aa{Xn) Aa{Xn) = Aa{Xn) + ^ 
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Note that these transformations are local in space. (The vector potential Aq(x„) 
and the gauge function xi^n) are functions in space.) They can be written in 
the A^-particle configuration space, 

^ -b-X , 

V 

(56) 

Aa Aa = Aa -b Oax ■ 

(57) 

where 

X{x) = V x(xn) ■ 

(58) 


The accumulation of many small steps is described by a Fokker-Planck equa¬ 
tion which can be written either as a continuity equation, ea. p 2 p . or in its 
Hamiltonian form, eq. (IMl)- As might be expected, the current velocity 
ea. (l24l) . and the ensemble Hamiltonian H, ea. (l27l) . must be suitably modified, 

— Ab) with $ = 77 ( 1 ^ — log, (59) 


and 


h[pM 



\p^ 


AB 


— Aa) — Ab) -b pV + 



OapObP 


(60) 

As a shortcut here we have adopted the same functional F[p] motivated by 
information-geometry, ea. (H51) . and set ^ The new FP equation now 

reads. 


dtp = -Oa [pm^^ ( 9 b $ - Ab)] 


( 61 ) 


The requirement that H be conserved for arbitrary initial conditions amounts to 
imposing the second Hamilton equation, ea. ([30|) . which leads to the Hamilton- 
Jacobi equation. 


= —m 
2 


AB 


(9^$ — Aa) (Ob^ ~ Ab) V — 


-m 


AB 




All 


(62) 


Finally, we combine p and <i> into a single wave function, to obtain 

the Schrodinger equation, 

t2 

= — —m^^{dA - ^AA)(dB - + V'i>k ■ (63) 

2 a a 

We conclude with two comments. First, the covariant derivative 


% 0 i 

DA = dA - j^Aa = - -pa'Mxn) 


(64) 


can be written in the standard notation. 


( 65 ) 
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where e„ is the electric charge of particle n in units where c is the speed of light. 
Comparing (l64l) with (1651) allows us to interpret the Lagrange multipliers 




77c 


( 66 ) 


in terms of the electric charges Cn and the speed of light c. Thus, in ED electric 
charge Cn is essentially a Lagrange multiplier a" that regulates the response to 
the external electromagnetic potential Aa- 

The second comment is that the derivation above is limited to static external 
potentials, dtV = 0 and dtAa = 0, so that energy is conserved. This limitation is 
easily lifted. For time-dependent potentials the relevant energy condition must 
take into account the work done by external sources: we require that the energy 
increase at the rate 


dH _ dH 
dt dt 


(67) 


The net result is that equations (EH), EH and EH remain valid for time- 
dependent external potentials. 


10 Some remarks and conclusions 

Are there any new predictions? Our goal has been to derive dynamical 
laws and in particular quantum theory as an example of entropic inference. This 
means that to the extent that we succeed and derive quantum mechanics and 
not some other theory we should not expect predictions that deviate from those 
of the standard quantum theory — at least not in the non-relativistic regime 
discussed here. However, the motivation behind the ED program lies in the 
conviction that it will eventually allow us to extend it to other realms, such as 
gravity or cosmology, where the status of quantum theory is more questionable. 

The Wallstrom objection — An important remaining question is whether 
the Fokker-Planck and the generalized Hamilton-Jacobi equations, eas. d^Sl) and 
EH, are fully equivalent to the Schrodinger equation. This point was first 
raised by Wallstrom in the context of Nelson’s stochastic mechanics [22] and 
concerns the single- or multi-valuedness of phases and wave functions. Briefly 
the objection is that stochastic mechanics leads to phases $ and wave functions 
'k that are either both multi-valued or both single-valued. Both alternatives are 
unsatisfactory: quantum mechanics forbids multi-valued wave functions, while 
single-valued phases can exclude physically relevant states {e.g., states with non¬ 
zero angular momentum). Here we do not discuss this issue in any detail except 
to note that the objection does not apply once particle spin is incorporated into 
ED. As shown by Takabayasi |3S] a similar result holds for the hydrodynamical 
formalism. The basic idea is that, as mentioned earlier, the drift potential (j) 
should be interpreted as an angle. Then the integral of the phase over 
a closed path gives precisely the quantization condition that guarantees that 
wave functions remain single-valued even for multi-valued phases. 
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Epistemology vs. ontology — Dynamical laws have been derived as an ex¬ 
ample of entropic dynamics. In this model “reality” is reflected in the positions 
of the particles, and our “limited information about reality” is represented in the 
probabilities as they are updated to reflect the physically relevant constraints. 

Quantum non-locality — ED may appear classical because no “quantum” 
probabilities were introduced. But this is not so. Probabilities, in this approach, 
are neither classical nor quantum; they are tools for inference. Phenomena that 
would normally be considered non-classical, such as non-local correlations, are 
the natural result of including the quantum potential term in the ensemble 
Hamiltonian. 

On dynamical laws — Action principles are not fundamental; they are conve¬ 
nient ways to summarize the dynamical laws derived from the deeper principles 
of entropic inference. The requirement that an energy be conserved is an im¬ 
portant piece of information (be., a constraint) which will probably receive its 
full justification once a completely relativistic extension of entropic dynamics 
to gravity is developed. 

On entropic vs. physical time — The derivation of laws of physics as 
examples of inference led us to introduce the informationally motivated notion 
of entropic time which includes assumptions about the concept of instant, of 
simultaneity, of ordering, and of duration. It is clear that entropic time is useful 
but is this the actual, real, “physical” time? The answer is yes. By deriving 
the Schrodinger equation (from which we can obtain the classical limit) we have 
shown that the t that appears in the laws of physics is entropic time. Since these 
are the equations that we routinely use to design and calibrate our clocks we 
conclude that what clocks measure is entropic time. No notion of time that is in 
any way deeper or more “physical” is needed. Most interestingly, the entropic 
model automatically includes an arrow of time. 
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